#synthetic data augmentation04/07/2025
Crome: Google DeepMind's Causal Framework Enhances Reward Modeling for Safer LLM Alignment
Google DeepMind and collaborators introduce Crome, a causal framework that improves reward modeling robustness in LLM alignment by using counterfactual data augmentation to tackle reward hacking issues.